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THE QUADRIFOCAL VARIETY 


LUKE OEDING 

Abstract. Multi-view Geometry is reviewed from an Algebraic Geometry perspective and 
multi-focal tensors are constructed as equivariant projections of the Grassmannian. A con¬ 
nection to the principal minor assignment problem is made by considering several flatlander 
cameras. The ideal of the quadrifocal variety is computed up to degree 8 (and partially 
in degree 9) using the representations of GL(3) x4 in the polynomial ring on the space of 
3 x 3 x 3 x 3 tensors. Further representation-theoretic analysis gives a lower bound for the 
number of minimal generators. 


1. Introduction and background 

1.1. The multi-view variety. Multi-view Geometry is a branch of Computer Vision [22]. 
An important task in Computer Vision is to efficiently reconstruct the 3-dimensional scene 
from the 2-dimensional projections. Typically, one first estimates the multi-focal tensor 
associated to the n views using correspondences arising from one object seen in multiple 
images. From the multi-focal tensor one reconstructs the camera matrices. After the camera 
matrices are known, one uses the point correspondences to triangulate the 3D points. 

In the standard pinhole camera model, the projection of 3D world points to multiple 2D 
images is represented by a collection of 3 x 4 matrices (A,. .., A n ) and the mapping 

, , P 3 -A P 2 x • • • x P 2 

[x] i-A ([Ax],..., [A n x]). 

For cameras in general position the multi-view mapping ([T]) defines a 3 dimensional subvariety 
of the Cartesian product of projective spaces called the multi-view variety. Aholt, Sturmfels, 
and Thomas demonstrated rich algebraic geometry arising from this construction in [5], 
They utilized a certain Hilbert scheme to describe the multi-view variety, its defining ideal, 
and further algebraic properties. Their theoretical techniques included Borel fixed monomial 
ideals, a universal Grobner basis, degeneration to a special monomial ideal, and more. This 
work catalyzed a new area coined “Algebraic Vision” by Sameer Agarwal and Rekha Thomas. 
The reader may wish to consult the following other examples of recent work in this held 

la m Ei El si m\. 

1.2. Moduli spaces and quadrifocal tensors. Suppose the entries of the camera matrices 
are not known, but are considered as parameters. By moding out by the projective rescaling 
in each camera plane we have a moduli space of camera matrices. The algebraic varieties of 
multi-focal tensors are models for these moduli spaces, and we want to know their basic alge¬ 
braic properties. Bifocal tensors are just 3x3 matrices of rank 2, defined by the 3x3 determi¬ 
nant. Chris Aholt and the author resolved the long-standing open question of describing the 
ideal of trifocal tensors [4], building on work of Alzati and Tortora [6]. They also computed 
its algebraic degree and Hilbert polynomial using Maple, Macaulay2 P2], and Bertini [7]. 
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In this paper we will be mainly concerned with the quadrifocal variety. One may record the 
correspondences induced from one 3D point seen in 4 images by the 81 special 4x4 minors of 
stacked camera matrix A = (Af[ \Aj \ ■ ■ ■ | A][) that only use one column from each of the first 
four blocks of A. These coordinates are linear in each block, yielding a3x3x3x3 quadrifocal 
tensor. The quadrifocal variety is the Zariski closure in P 80 of the set of quadrifocal tensors. 

We seek a complete description of the polynomial defining equations of the quadrifocal 
variety. The main result of the present article is a first step in this direction. 

Theorem* 1.1. Let Id denote the degree d piece of the ideal of the quadrifocal variety. 

Id is zero for d < 3. 

I 3 is 600-dimensional. 

I 4 . is 48, 600 -dimensional but contains no minimal generators. 

I 5 is 1,993, 977 -dimensional and contains at least 1,377 minimal generators. 

Iq is 54,890,407 -dimensional and contains at least 37,586 minimal generators. 

If is 1,140, 730,128 -dimensional and likely contains no minimal generators. 

J 8 is 18,940,147,947 -dimensional and contains at least 162,000 minimal generators. 

I 9 is > 223, 072, 284,455 -dimensional and contains at least 3, 087, 000 minimal generators. 

The star refers to the fact that some of our computations were done using random points 
of the quadrifocal variety, so the dimensions reported are only upper bounds, but the lower 
bounds hold with high probability. I 11 Section Owe give an invariant description of all these 
equations and computational evidence that the equations reported here are likely not a 
minimal set of generators. 

Until now, it was only known that a quadrifocal tensor must adhere to 51 non-linear 
constraints m ■ Indeed, the quadrifocal variety has codimension 51, so there must be at least 
51 equations, but our results show that it is very far from being a complete intersection. For 
instance, there are 600 cubic minimal generators. In Section [5] we give a simple description 
of these equations via contractions. From the contraction description we see that the cubic 
equations are a consequence of the fact that every contraction of a quadrifocal tensor is a 
homography tensor [46]. Additional equations are needed to take the set of tensors having 
that property alone and cut it down to the quadrifocal locus. 

1.3. Multi-focal tensors in general. If tt is a partition of 4, the entries of a multi-focal 
tensor of profile 7r are given by the minors of A that use 7 g columns from block i. Multi-focal 
tensors record correspondences between multiple images. An epipole is the image of one 
camera’s focal point seen in another view. Correspondences between pairs of image points 
(in 2 views) leads to the bi-focal tensor, or fundamental matrix. Whereas a point-point- 
line correspondences (in 3 views) are encoded by a trifocal tensor [15], and correspondences 
of quadruples of image points (in 4 views) are encoded by a quadrifocal tensor [47]. See 
[20, 2F; [26, 52] 09] for applications of quadrifocal tensors. 

Each multi-focal tensor can be determined by observing some minimal number of corre¬ 
spondences in multiple images. From 7 point correspondences in 2 images one can reconstruct 
the bifocal tensor (fundamental matrix). 5 point correspondences in 3 or 4 images suffice to 
determine the trifocal and quadrifocal tensors [221 Parts III&IV]. For a summary of these 
and other minimal problems in Computer Vision, see [42], and in particular one can check 
W\ (33 HH £3 m\ for recent algorithms for the relative pose problems in 3 and 4 views. 

Since algorithms for determining the relative pose of 2 cameras exist (for instance [34]) 
one may ask what is the advantage of considering more than 2 views at a time. The first 
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advantage is that fewer 3D points need to be identified for more than 2 views, which can be 
useful when many points in one view become outliers for another. In the 3 view case, the 
trifocal tensor can be used in the structure from motion problem when the cameras move 
along a straight line (see PH EH EH). Other advantages include greater stability, and the 
possibility to avoid certain unstable or critical configurations. In the 4 view case, while the 
quadrifocal tensor is more difficult to construct, one may avoid iterative algorithms which 
are not guaranteed to converge [22J Ch. 1], Moreover, unlike in the 3 view case where one 
view plays a special role, in the 4 view case all 4 images can play the same role, and it is 
expected that this symmetry can be exploited. For 3D to 2D projections and 4 or more 
views there are no n-focal tensors if n > 4, hence we only consider the 2, 3 and 4 view cases. 

1.4. Outline. For the reader’s convenience we collect notation in Section [21 We give a 
uniform presentation of multi-focal tensors from the viewpoint of Algebraic Geometry via 
equivariant projections of a Grassmannian in Section [3l which connects our work to [141 20 ], 
Section [4] addresses the case of different dimensional cameras and contains a connection 
between the multi-focal variety and the variety of principal minors of square matrices, con¬ 
necting this work to [33]. We discuss contractions and homography tensors in Section [5l and 
state Proposition 15.31 describing the 600 bihomogeneous cubics that are the minimal genera¬ 
tors of the quadrifocal ideal in the lowest degree. We use symmetry-enhanced calculations to 
determine the quadrifocal ideal up to degree 8 and partially compute the ideal in degree 9 in 
Section [ 6 ] In addition, we use representation theory and computations utilizing SchurRings 
to determine necessary minimal generators of the quadrifocal ideal. We conjecture that these 
necessary minimal generators also suffice. 

2. Notation 

Let U and D denote vector spaces, which we always consider the complex numbers, denoted 
C, to be our ground field. The direct sum of U and D is denoted U © D, and their tensor 
product is denoted U <g)D. The fc-th exterior power of D, or the fc-mode skew-symmetric 
tensors is denoted f\V. Its elements are linear combinations of A;-fold wedge products 
V\ A v 2 A • ■ ■ A v n with v t G D. We discuss alternating tensors in more depth in Section 13.2.II 
The vector space of symmetric d-mode tensors is denoted S d V. If we choose a basis x\,... ,x m 
of D we may consider S d V as the vector space of homogeneous degree d polynomials on 
x\,..., x m . The symmetric algebra on D is denoted Sym'V = 0 rf>o S d V, and is isomorphic 
to the polynomial ring C[aq,..., x n \. The tensor algebra on D is denoted D® = 0 d>o D® d . 

The vector space dual, denoted D*, is the space of all linear functionals V —> C. After 
bases are chosen for U and V, U*<S>V may be thought of as the space of matrices representing 
linear mappings U —* D. If M G U*®V is a matrix /\ A M G /\ fc I/*<g)/\ A D may be thought of 
as the matrix whose entries are the k x k minors of M. The general (special) linear group of 
all invertible (determinant 1) linear transformations of D is denoted GL(D) (resp. SL(D)). 

We use the notation tt b d to denote a partition i r = (pi,p 2 ,... ,p n ) with Y2iPi = d- 
We let S n V to denote the corresponding Schur module, which we think of as an explicit 
representative of an irreducible submodule of the d-fold tensor product D® d . Given vector 
spaces Vi, V 2 , V 3 , V 4 and multi-partition 7 r = (viy, 7 r 2 , 7 t 3 , 7 t 4 ) with tt b d we use the shortened 
notations S n V and S , 7 ri S , ^- 2 5' 7 r 3 5 ' 7r4 for S^ l Vi®S^ 2 V 2 ®S^ 3 V^®S ni V 4 ,. 

The projective space of all lines through the origin in V is denoted PD. If v G V is 
nonzero the line through v is denoted [u]. If X C P U and Y C PD are algebraic varieties 
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their Cartesian product, denoted A" x Y is a subvariety of P (U © V ). The cone over the 
projective X C PC, denoted A" is an affine subvariety of V. 

Multifocal tensors sometimes need a mixture of indices and multi-indices. For instance, 
fundamental matrices are indexed by a pair of double indices: the notation (Fpjip. n) 
indicates a matrix with rows indexed by the double index {i,j}, and columns indexed by 
the double index {k,l}, while {Tij^k,i}) denotes a 3-mode tensor with the first two modes 
indexed by i and j respectively, with the third mode indexed by the double index {k, l}. 

3. Epipoles, fundamental matrices, trifocal and quadrifocal tensors 

3.1. The multi-view setup. Multiple view geometry arises when one considers many im¬ 
ages taken of the same scene, from (possibly) different viewpoints and is beautifully presented 
in [22] , The following introduction is an invariant view inspired by the ideas in [22, Ch. 17]. 
Let Aj denote 3x4 camera matrices (non-degenerate) for 1 < j < n, with row spaces equal 
to Vj, a three-dimensional vector space. 

Let W denote a 4-dimensional vector space, whose projectivization represents the 3- 
dimensional “projective world”. For fixed camera matrices Aj, the multi view map (which 
also appeared in [5]) is 

(2) FW —. ln> - PVi x • • • x FV n . 

Now we wish to treat the camera matrices as variable or as having indeterminate entries. 
The map in (J2]) is the same if we replace the matrices Aj with scalar multiples of themselves. 
So, we should consider our space of parameters for cameras to be 

P(lTWi) x ■ ■ • x P (W*®V n ) n -camera space. 

We note here that if different camera models are taken, this paradigm may be easily altered 
to accommodate such changes by altering the spaces in which the cameras are modeled. 

3.2. The Grassmannian. Faugeras and Mourrain studied multi-view geometry from the 
point of view of the Grassmann algebra, see pi eq]. We also adapt that approach as it 
provides a uniform treatment and a convenient way to organize many of our computations. 

3.2.1. Exterior products. Recall if V is a vector space with basis u\,... ,u n , we construct 
the exterior powers of U, denoted /\ fc U by considering the alternating (or wedge) product A 
and forming the vector space of fc-vectors (length k wedge products) with basis consisting 
of pure k -vectors {u^,... ,Ui k \ 1 < i\ < i 2 < ■ ■ ■ < ik < n}- Thus f\U has dimension (“) if 
0 < k < n and 0 otherwise. It is straightforward to see that v\ A • • • A ^ 0 if and only 
if the vectors {ui,..., Vk} are linearly independent in U. Consider two non-zero fc-vectors 
Vi A • • • A Vk and W\ A • • • A Wk and the underlying vector spaces E := span{rq,..., Vk} and 
F := span-fuT,..., w fc }. It is straightforward to check that 

Vi A • • ■ A v k — \(wi A • • • A Wk) for some A ^ 0 if and only if f\ E = f\ F, 

and equality holds when A is the determinant of the change of basis between E and F. 

We denote by Ff^U the projective space consisting of lines through the origin in /\ \U , 

which we may consider as the set of classes [a;] = < Xoj \ A G C \ {0},a; e /\ fc f/ \ {0} >. 
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This motivates the definition of the Grassmannian (in its minimal embedding). Let 
Gr (k, U ) denote the set of fc-planes in U. The rational map: 

Gr (k,U) —>• PA k U 
M i —y i\M 

is an embedding (in fact, the embedding is a minimal rational embedding). The usual Pliicker 
embedding is a slight variant of this construction, which we will review next in the context 
of multiple view geometry. 

3.2.2. From multiple views to the Grassmannian. It is natural to consider the following 4 x 3rt 
blocked matrix, which will present a convenient way to keep track of external constraints on 
the multi-view setup 

M=(AJ\AJ\... | A T n )e {W*®V_ i) © ■ • • © (W*®V n ) = W*®(Q j V)). 

The non-degeneracy condition is that each matrix Aj in an n-tuple ([A) 1 "], [Aj],..., [.Aj]) 
must have full rank, which occurs in an open set; so the row space of M parameterizes the 
(Grassmannian variety) 4-dimensional subspaces of a 3n-climensional space. 

The maximal minors of M give coordinates (the Pliicker coordinates) on the Grassmannian 
Gr(4,3n). These minors are also known as multilinear coordinates. 

In invariant language this parameterization is the following map 

,,, ^w'*®(©,v) —*■ AV*®A 4 (©,n)^A 1 (© i U) 

1 ; Ml —» t\M 

The image of p (the Zariski closure of the image of an open set) is isomorphic to the cone 
over the Grassmannian of 4-dimensional subspaces of 0.1^-, which is the row space of M. 
In other words 

MvO = 61(4, ©, v s ) c A J (©, U). 

Notice that because W is 4-dimensional, f\ W is one-dimensional, and thus passing to the 
maximal minors of the concatenated matrix M removes the dependency on the world points 
represented by PIT. 

It is well known that the dimension of the Grassmannian Gr(r,C N ) is r(N — r). In our 
example dim(Gr(4, 3 n)) = 4(3 n — 4). 

If we restrict to camera space, we have to consider the image of the map up to the n- 
dimensional torus action which records the projective ambiguity in each of the n cameras. 
We can restrict the target of the map to the appropriate GIT quotient'. 

P(IWWi) x • • • x P(WWn) -A Gr{ 4, 0. y;-)//(C*) n C A 4 (®, TA//(C*) n 

From this we obtain the dimension of the GIT quotient (see [221 § 17.5] and [5j § 6 ]) 

dimG^(4,0I/ i )//(C*) n = 4(3n - 4) + 1 - n = lira-15. 

3 

Remark 3.1. Here is a classical formula for the degree of the Grassmann manifold, (see [35]. 
[I 6 l Ex. 14.7.11] or [EH § 10 . 1 . 2 , eq. ( 10 . 6 )]) 

1 ! 2 ! • • • r! dim(Gr(r, n))! 

(n — r)!(ra — r + 1 )! • • • n\ 


deg Gr (r, n) 
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For example 

deg(Gr(4, 6)) = 14, deg(Gr(4, 9)) = 1662804, deg(Gr(4,12)) = 1489877926680. 

One hope is that a better understanding of the GIT quotient of the Grassmannian might 
allow us to find the degree of the multi-focal tensor varieties. 

3.3. Symmetry. For j e {1,2, 3,4} let Aj be a 3 x 4 (non-degenerate) camera matrix (an 
element of W*®Vj), with blocking Aj = ( Bj\x.j ). On each matrix Aj we have an action of 
GL(Vj) = GL(3) acting by change of coordinates in the camera plane. The action is 

(GL(Vi) x GL(G 2 ) x GL(G 3 ) x GL(G 4 )) x ((WWi) © (W*®V 2 ) © (WW 3 ) © (W*®V 4 )) 

->• ((iwwi) © {w*®v 2 ) © (ww 3 ) © (vfw 4 )) 

(pi? 92 , g 3 , g^), (A\, A 2 , A 3 , Aj) h> (giAi, g 2 A 2 , g 3 7l 3 , g^A^), 

where we take the action of each gj to be a change of basis in the row space of Aj. Because 
the matrices Aj are assumed to be full rank, we can, without loss of generality, act by an 
element of GL(3) x4 and assume that Bj = Id 3 and move the 4-tuple (Tl^|707l{~|7lJ) to 


/ 1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 0 

i 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

l 0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

V ®i,i 

* 1.2 

* 1,3 

* 2,1 

* 2,2 

* 2,3 

* 3,1 

* 3,2 

* 3,3 

* 4,1 

* 4,2 

* 4,3 


Remark 3.2. There is an action of GL(PF) acting on simultaneously on all of the column 
spaces of Aj (which are all equal to W). While this action doesn’t turn out to be useful (it is 
trivialized in the tensorial map), the action of © 4 permuting the matrices Aj , also permutes 
the indices in the image of the tensorial map, and preserves the set of quadrifocal tensors. 

3.4. Multilinearity spaces. The 4x4 minors of M come in several classes, which have 
invariant descriptions. By construction, /\ 4 (0 ; Vj) is a vector space with a natural GL(3n)- 
action, but we can further view this as a GL(/\ 1 (0 / V)))-action (each GL(V)) acting by 
invertible linear change of coordinates in Vj), and there is a natural inclusion of G := 
n, GL(Vj), which may be thought of as block diagonal (with the proper choice of basis) 
inside GL(3n). Moreover, we may view G as a product of a torus T n := WjTj — (C*) n 
(Tj = C* acting by scaling block j of M) and {{j SL(V^). In summary 

G FI 7 xIF L «> c GL(A“0U). 

3 3 

Thus, we may consider Gr(4, 0 V)) as a G- variety. On the other hand, on the GIT 
quotient, the torus action is trivialized, so we will consider G' : = asm/,) acting on 

Gr(4, 0 Vj) C A 4 ® Vj and on the GIT quotient Gr(A, 0. Vj)//(C*) n C A 4 (©^')//(C*) n . 
The effect of trivializing the torus action is that we may identify every Vj with its dual, and 
this induces an identification of every irreducible representation S^Vj with its dual S^Vj- 
Now G acts on A i (0 / - Vj) an d it has a decomposition into irreducible G-modules: 

A 4 (©., Vi) = (® w AV.®n) e (® (<j AVi® A 2 n) 

® (©«{j,»w< k AV,®Kj®H) ® (©. r ■ 

The 4 non-isomorphic module classes and their descriptions are listed in Table [H 

Now we will consider the projection of the cone over the Grassmannian GV(4, 0. Vj) to 
each type of multi-linearity space. The images of the projections are respectively the single 
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Columns used 

Invariant description 

Space 

i\ i-i i-i j 

h ^2 ji ji 

k k j k 

i j k l 

Kv&Vi = Vj 
A 2 v)©A 2 L = v*®v* 

f\Vi®Vj®V k ^ V*®Vj®V k 
Vi®Vj®V k ®Vi 

epipole space 

fundamental matrix space 
trifocal space 
quadrifocal space 


Table 1. Classes of multi-focal tensors. Assume z,j, kj are distinct, i s are 
in block z, and j s are in block j. 


S = ( (xi,l~X 2 ,l) (-l)(zi,2-£2,2) (^1,3-^2,3) ) 


(1, 2 )-epipole 


0 £1,3—22,3 -£l, 2 +£ 2,2 

F — ( — £l, 3 +£ 2,3 0 21,1-22,1 

£1,2—£2,2 —£l,l+£2,l 0 


(1, 2 )-fundamental matrix 


£l,iY£ 2 ,i 
£l,iT£3,i 
£ 2 ,i £3 

^i,j,{k,l} = 0 } 


k , l distinct, 
i distinct from /c&Z, 
else. 


(1, 2, 3*) -trifocal tensor 


Qi,i,k,l — ( 1) (£ 1,i £2,i) = 

Qi,k,i,l = ( 1) (£l,i £3,i) — 

Qi,k,l,i — ( 1) '(£l,i £4,i) — 

Qk,i,i,l — ( l)^(£2,i £3,i) — 

Qk,i,l,i — ( l)*(£2,i £4,i) — 

Qk,l,i,i — ( 1) (£3,i £4,i) — 

Qi,j,k,l — 0 


Qi,l,i,k j 
Qi,l,k,ii 
Ql,i,i,ki 

Ql,k,i,i 




k < l, 

i distinct from k & l, 


(1, 2, 3,4 )-quadrifocal tensor 


} else. 


Table 2. Parametrizations of multi-focal tensors up to symmetry. 


view, the epipolar variety, the trifocal variety, and the quadrifocal variety. The fiber of the 
projection over a general point is the product of the ignored camera planes and the torus 
acting on the utilized camera planes. So it is interesting to consider the minimal number of 
cameras in each case. Moreover, because the projections on the level of vector spaces are 
equivariant, the images are automatically invariant (with respect to the appropriate group). 

Suppose now that there are at most 4 cameras. For each i e {1,2,3,4} let the set 
{e\, e l 2 , e l 3 } denote a basis of Vj, which also provides an ordered basis on C 12 = V\ © V 2 © V 3 © 
V 4 . I 11 the remainder of this section we consider the parameterization of epipoles Sec. 13.51 
fundamental matrices Sec. 13.61 trifocal tensors Sec. 13.71 and quadrifocal tensors Sec. 13.81 all 
up to the G-action using the matrix A in (UJ). These results are summarized in Table [2j 

3.5. Epipoles. For a pair of cameras, we consider the projection of Gr(4, 6 ) = Gr(4, V\ © 
V 2 ) C P /\ 4 {V\ © V 2 ) = P 14 to epipolar space P(Id©/\V 2 ) = P 2 . The target space is (naturally 
isomorphic to) the projective plane, and the map subjects onto P 2 . The image is naturally 
GL(Vi) x GL(V 2 )-invariant, the action of GL(Vi) being trivial and the 2 -dimensional torus 
acts by a weight of (3,1). 

The image of the projection is the space of epipoles in view 1 imposed by view 2. Thus the 
epipoles may be recovered from the multi-view setup via a projection from the Grassmannian. 
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To get an expression of an epipolc in coordinates consider just two cameras and the matrix 


/ 1 

0 

0 

1 

0 

0 \ 

1 0 

1 

0 

0 

1 

0 1 

l 0 

0 

1 

0 

0 

1 

V x i,i 

3:1,2 

3:1,3 

3:2,1 

3:2,2 

CO 

oT 


An element S of single camera space has the following form in the Pliicker coordinates 
associated to the determinants of the matrices constructed from one column from the first 
block of A the three columns of the second block: 

S = S P,W} e l A el A A e\ G /^V 2 ®V 1 - Vj. 

l<p<3 

Applying this to A we get the coordinates of the epipole 

[ Sl,{l,2,3}(A)\ ( (xyi —X2,l) \ 

I ^ 2 ,{ 1 , 2 , 3}(^4) I = I ( —1)(^1,2 — £ 2 , 2 ) I • 

\*S , 3,{1,2,3}("4)/ \ (^1,3 - £ 2 , 3 ) / 

3.6. Fundamental matrices. Again for a pair of cameras, we may consider the projection 

of Gr(4, 6 ) = Gr(4, © V 2 ) C P/\ 4 (G] © V 2 ) = P 14 to fundamental matrix space P(= 

P 8 . The target space may be interpreted as the projectivization of a space of 3 x 3 matrices, 
and caries the natural action of GL(Vi) x GL(l^). We might call the image of the projection 
the variety of fundamental matrices, which is also naturally GL(Vi) x GL(V 2 )-invariant. 
Because the vector spaces V\ and V 2 play symmetric roles, this image is also naturally ©2 
invariant. It is well known that the matrices in the image of the projection have a one¬ 
dimensional kernel and the image variety is just the (degree 3) determinantal hypersurface. 

Note that Gr( 4,6) is 9-dimensional, the GIT quotient Gr(4, 6 )//(C *) 2 is 7-dimensional, 
and thus birational to the projective variety of 3 x 3 matrices of rank < 2, the variety of 
fundamental matrices. Also note that the 2 -dimensional torus acts by a weight of ( 2 , 2 ). 

An element F of fundamental matrix space has the following form in Pliicker coordinates: 

F = V F {iA(t , } ej A e) A e\ A e? 6 RV&pV, = UW/. 

l<ii<j, k<l 

For a fixed pair of distinct indices i,j, a fundamental matrix is gotten by applying F to a 
blocked 4 x 12 camera matrix A. In particular, F(A) may be thought of as a vector whose 
coordinates are determinants of the 4x4 submatrices of A obtained by taking two columns 
from each from blocks i and j of A. 

Now apply F to A in (14j) . The (1-2) fundamental matrix associated to A is described by 
coordinates Tpjmfc,;}, and we can represent this as the matrix 

/ 0 2'1,3 — 22,3 — 2i, 2 + 22g\ 

F(A) = -Xi ,3 + x 2)3 0 2 - 1,1 - 2 2 ,l 

\ 2 i , 2 - 2 2 ,2 - 2 yi + 2 : 2,1 0 ) 

where the (p, q ) entry is such that {p, i,j} = {q, k, 1} = {1, 2, 3}. 

3.7. The trifocal variety. For a triple of cameras, we consider the projection of Gr(4, 9) = 

Gr(4, Vx@V 2 @ V 3 ) C PA 4 (1A ® V 2 © V 3 ) = P^^ -1 to trifocal space P(W®1A 2 <gW 3 *) = P 26 . The 
target space may be interpreted as the projectivization of a space of 3 x 3 x 3 tensors, and 
caries the natural action of GL(Vi) x GL^) x GL(Vs). The image of the projection is called 
the trifocal variety, which is also naturally GL(Vi) x GL(V 2 ) x GL(V 3 )-invariant. Because 
the vector spaces V\ and V 2 play symmetric roles, this image is also naturally © 2 invariant. 
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Non-trivial permutations involving V 3 will not preserve this trifocal variety but produce an 
isomorphic copy it. 


The trifocal variety is 18-dimensional ([22, p. 368]). The GIT quotient Gr(4, 9)//(C *) 3 is 
18-dimensional and thus is birational to the trifocal variety. In this case, the 3-dimensional 
torus acts by a weight of ( 1 , 1 , 2 ). 

For the trifocal tensor, consider 3 cameras in special position producing 


A = 


1 0 0 

010 
0 0 1 

*1,1 si,2 *1,3 


1 0 0 

0 1 0 

0 0 1 

^ 2,1 * 2.2 * 2,3 


10 0 

0 1 0 

0 0 1 

* 3,1 * 3,2 * 3,3 


An element T of trifocal space has the following form in Pliicker coordinates: 


T = J2 T iMk,i} e i A e ? A e fc A ef 

k<l 


e V x ®V 2 ®r\Vz = v x ®v 2 ®v 3 *. 


A trifocal tensor is gotten by applying T to A. In particular, T(A) may be thought of as a 
vector whose coordinates are determinants of the 4x4 submatrices of A obtained by taking 
one column from each of the first two blocks of A and two columns of from the third block 
of A. 


Lemma 3.3. The locus of trifocal tensors is the GL(3) x GL(3) x GL(3 )-orbit of the 9- 
dimensional linear space whose coordinates are given by TA,y,{fc,z} satisfying the following con¬ 
ditions: 

Ti,i,{k,l}(A ) = —X\ j + X 2) i \ 

Ti t k,{i,i}{A ) = —X \+ x :i/l > k and l distinct, and i distinct from k,l , 

(-T) X 2 i X3 j, J 
Tij,{ k ,i}(A) = 0 else. 

3.8. The quadrifocal variety. For a quadruple of cameras, we consider the projection of 
Gr(4,12) = Gr(4, V x © V 2 © V 3 © Vf) C P/\ 4 (Vj © V 2 © V 3 © V 4 ) = p( 4 ) _1 to quadrifocal space 
F(V 1 ®V 2 ®V 3 ®V 4 ) = P 80 - The target space may be interpreted as the projectivization of a 
space of3x3x3x3 tensors, and caries the natural action of GL(Vi) x GL(V 2 ) x GL(Vs) x 
GL(V 4 ). The image of the projection is called the quadrifocal variety, which is also naturally 
GL(Vi) x GL(V 2 ) x GL(V 3 ) x GL(V 4 )-invariant. Because the vector spaces Vj., V 2 , V 3 and V 4 
play symmetric roles, this image is also naturally © 4 invariant. 

The quadrifocal variety is 29-dimensional (|22j p. 423]), and the GIT quotient Gr(4,12)//(C *) 4 
is 29-dimensional, and thus birational to the quadrifocal variety. In this case, the 4- 
dimensional torus acts by a weight of ( 1 , 1 , 1 , 1 ). 

An element Q of quadrifocal space has the following form in Pliicker coordinates: 

Q = ^2 Qid,k,i e i A e) A e l A ef e Vi®V j ®V k ®Vi. 

A quadrifocal tensor is gotten by applying Q to a blocked 4 x 12 camera matrix A. In 
particular, Q(A) may be thought of as a vector whose coordinates are determinants of the 
4x4 submatrices of A obtained by taking one column from each of the 4 blocks of A. 

For the quadrifocal tensor, consider 4 cameras in special position. 
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Lemma 3.4. The locus of quadrifocal tensors is the GL(3) x GL(3) x GL(3) x GL(3 )-orbit 
of the 12-dimensional linear space of tensors whose coordinates are given by Qij,k,i satisfying 
the following conditions: 


Qi,i,k,l{A) ( 1) (*G,i 3*2,i) 

Qi,k,i,l(A) ( 1) *^3 ,i) 

Qi,k,l,i(A) ( l) i 3^4, i) 
Qk,i,i,l[Al) ( 1) (^2 ,i *^3 ,i) 

Qk,i,l,i(A) ( 1 )' (%2,i *^4 ,i) 

Qk , l , i , i (. A ') ( 1) (*^3,i *^4 , i ) 

Qi,j,k,l(A ) = 0 


Qi,i 

,l,k(A), 

Qi,l. 

,i,k{A), 

Qi,l. 


Qm ; 

,i,k(A), 

Ql,i. 

,k,i(A ), 

Ql,k 

,i,i{A) 


k < l and i distinct from k, l, 


else. 


4. Arbitrary dimensional cameras: a connection to principal minors 

It is curious to study the case when W and V t arbitrary dimensions (see 0 )- For instance, 
when each V t has dimension two, we might imagine the cameras to be flat landers’ cameras. 
When the V* have dimension more than 3, we might consider an image plane that records 
higher dimensional data such as temperature or color. 

4.1. Principal minors. In the case V = C 2 we find it very interesting that seemingly 
unrelated work regarding principal minors fits into this framework [23, 33], [10], [39, 38], [18], 19j. 

Suppose W has dimension m and that we have m copies of V t , each with dimension 2. 
Consider again the matrix M — ( Aj | AJ | ... | Af ), now as a m x 2 m matrix consisting 
of m blocks of size m x 2. By a left action of GL(W) we may assume that each block of 
M is of the form Aj = (e* bfj, where e* is the i-th standard basis vector of W and bi is 
arbitrary. Let B denote the m x m matrix with columns bi. 

It is straightforward to see that the maximal minors of M correspond to the minors of 
B and moreover that the minors of M that use precisely one column from each block of M 
correspond to the principal minors of B. 

In turn, this identification naturally gives the space Vi® ... ®V m the interpretation as the 
space of all 2 m principal minors of an m x m matrix (the 0 x 0 minor may be assumed to be 
1 in this construction). 

The case m — 4 was studied by [HU S3j, who discovered the minimal defining equations of 
the ideal of relations among principal minors of a generic 4x4 matrix, and found that the 
algebraic variety coincides with the main component of the singular locus of the 2 x 2 x 2 x 2 
hyperdeterminant. 

Remark 4.1. Weyman and Zelevinski [51] analyzed the singular locus of the hyperdetermi¬ 
nant. In the 2 x 2 x 2 x 2 case, there are 8 components of the singular locus: 

^cusp U Vnode (0) U V node ({iJ}) 1<*<J<4. 

Moreover, according to Holweck, [24] the “main” component V no d e (0) also has the interpre¬ 
tation as the projective dual of the secant line variety: 

V node (0) = ^(P 1 x P 1 X P 1 X P 1 ) V C F{V 1 ®V 2 ®V 3 ®V 4 )*. 

The variety of principal minors of 4 x 4 matrices is the projection 

Gr(4, Vi © V 2 © V 3 © V4) —> P( V4). 

The dual variety of Gr(4, 8 ) is not one of the 3 cases which is normal [24], however it does 
have many other nice properties, such as a finitely generated invariant ring, finitely many 
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orbits in the null cone, and a set of normal forms that depend on 8 parameters, |12j . It 
would be nice to see how to exploit these coincidences. 

In the case that the matrix B is assumed to be symmetric, [23] re-invigorated a classical 
study going back to Cayley of relations amongst principal minors of symmetric matrices. 
(See [33] and [39] for a historical discussion.) In particular, the ideal of relations of the 
principal minors of a generic symmetric 3x3 matrix is generated by a single equation 
known as Cayley’s 2x2x2 hyperdeterminant, denoted h for this discussion. In the 4x4 
case Holtz and Sturmfels discovered that the © 4 x J ][ i=1 4 SL(V))-orbit of h generates the 
ideal of relations, and conjectured that a similar orbit of equations, which they called the 
hyperdeterminantal module, generates the ideal of relations among principal minors of a 
symmetric matrix. This conjecture was solved (set-theoretically) by the author [391138] . We 
note that in the framework of this paper, imposing that the matrix B be symmetric naturally 
imposes a restriction from the Grassmannian Gr(m, 2m) to the Lagrangian Grassmannian 
LGr(m, 2 m), (see [38]). 

In the case that the matrix B is assumed to be skew-symmetric, one considers the principal 
Pfaffians. It turns out that the relations among principal Pfaffians are precisely the equations 
of the orthogonal Grassmannian, which are analogous to the Pliicker relations and were 
known classically, (see [29] for a modern treatment in invariant language). 

4.2. Higher dimensional images. It would be interesting to consider, for instance, a 
camera with fixed position continuously viewing a scene in time in our paradigm as IP a 
5-dimensional vector space and V) each as 4-dimensional vector spaces (3 space dimensions 
and one time). This seems to be a promising approach to understanding this and a wide 
variety of generalizations. 

5. Contractions and homography tensors 

According to Shashua and Wolf Ed a contraction of a quadrifocal tensor is a homography 
tensor [46l| in the other 3 views. Moreover, this property defines the set of homography 
tensors. Similarly, the contraction of a homography tensor is an LLC mapping (a rank 2 
matrix relating the epipoles in two views). We will give an invariant description of this 
process and show how to get 600 independent internal constraints on quadrifocal tensors. 

Consider Vi®V- 2 ®V 3 ®Vi and suppose we choose a basis of V 4 , { x,y,z }. Then we may 
write Q £ V\®V- 2 <%)V 3 &Va as 

Q(x, y, z ) = xQ 1 + yQ 2 + zQ 3 , 

where Qi are the standard 3x3x3 slices of the 3 x 3 x 3 x 3 tensor. By abuse of notation, we 
also think of Q(x,y,z ) as a function from C 3 to P 1 GV 2 GV 3 , and let x,y,z act as variables. 

The following results come from m and [46]. 

Theorem 5.1 ( [471 Theorem 2]). If Q is a quadrifocal tensor then Q(x, y, z) is a homography 
tensor for every value of x, y, z. 

For our purposes, the above result can be used to define the notion of “homography 
tensor.” Now choose a basis a, b , c of V 3 . If H £ Vi®V 2 ®V 3 , then we may write H(a, b , c) = 
aH 1 + bH -2 + cH 3 . In slightly different language [4J Prop. 7.2] we showed that the Zariski 
closure of the homography tensors is an irreducible variety (we called it P-Rank 2 ’ 2 ’ 2 ), defined 
(at least set-theoretically) by 30 cubic equations. 
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Theorem 5.2 ([47, Theorem 1]). If H is a homography tensor then H(a,b,c ) is an LLC 
(Linear Line Complex) mapping for every value ofa,b,c. 

For our purposes, an LLC is a skew-symmetric 3x3 matrix, which necessarily has (even) 
rank < 2. Thus for all values of a, b, c the matrix H(a, b , c) must satisfy the constraint 
det (H(a,b,c)) = 0. In particular, every coefficient in the cubic polynomial in a,b,c must 
vanish. This condition gives a basis of the ten-dimensional space of cubics. Note, the coeffi¬ 
cient of a 3 is the determinant of the first slice, and the coefficients on b 3 and c 3 are respectively 
the determinants of the second and third slices. This space also has the interpretation of 
A^i'ffiA 3 ^*®^ 3 !^* as a G-module. 

We can apply the same method to quadrifocal tensors. Namely if Q is a quadrifocal tensor, 
Q(x,y,z ) must satisfy all of the internal trifocal constraints for all values of x,y,z. In par¬ 
ticular, Q(x,y, z)(a,b,c) must be an epipolar matrix (whose entries are bi-homogeneous 
quadrics), and we must have the bi-homogeneous sextic polynomial (of bi-degree (3,3)) 
det(Q(;c, y, z)(a, b, c)) = 0 for all values of x, y, z, a, b , c. 

The space of bi-degree (3,3) sextics is 100-dimensional, and the coefficients of the expres¬ 
sion det(Q(x,y, z)(a,b,c)) provide a basis of a 100 -dimensional space of cubics that vanish 
on the quadrifocal variety. Note, the coefficient of x 3 a 3 is the determinant of the first slice, 
and there are 8 other monomials that are the product of two cubes, the coefficients of which 
correspond to the determinants of the 8 other slices. This space also has the interpretation 
as the GL(3) x4 -module A 5 kd®A ? k 2 ® 5 ' 3 V 3 < 8 )S' 3 V 4 . 

If we interchange the roles of Vi,V 2 ,V 3 ,V 4 , and apply the same construction we obtain 
6 non-isomorphic modules of the same format /\’Vi®/\)Vj®S 3 Vk®S 3 Vi. In particular, we 
find a space of 600 cubic polynomials in the ideal of the quadrifocal variety, 54 of which are 
determinants of 3 x 3 slices ofa3x3x3x3 tensor. 

Let G := 64 K GL(3) x4 . Since the quadrifocal variety is G- invariant, we can describe 
its ideal as a G module. For convenience, when the © 4 symmetry is present, we write 
S 7ll S 7T2 S 7r:i S 7r4 for the direct sum of S 7T1 Vf®S W2 Vf®S 773 Vf®Si r4 Vf and all non-redundant 
copies of it obtained by permuting the indices. 

The above discussion implies the following: 

Proposition 5.3. Suppose Q is a quadrifocal tensor. Then the 600 polynomials forming a 
basis of the G-module vanish on Q. 

Because one of the contractions of a trifocal tensor also form a homography, we know 
that 10 cubic polynomials vanish on the set of trifocal tensors. I 11 [4] we showed that this 
condition cuts out a subset of the 3x3x3 tensors consisting of 4 irreducible algebraic 
varieties. In order to distinguish the trifocal variety, more equations are needed. 

Theorem 5.4 ( [4j). The ideal of the trifocal variety in V\®V 2 ®Vf is generated by 10 cubic, 
81 quintic, and 1980 sextic polynomials. These are represented by the following GL(3 ) x3 
modules: 

M 3 = f\V{®tfV 2 *®S 3 V 3 , 

M.§ = 5*221 k / j*®*S , 22lk2 l! <8)5 , 31lV3 © 5221 Gi*ffi5221 V 2*©5 , 22lF3, 

MIq = 5 4 llP 1 *®5 3 3'F2*®5222V r 3 © 

® 5 2 22 L 4 *® 5 3 3 Id,* ®5 4 h V 3 ® 

ffi S 2 22 Vf®SssVf®SssVs ffi 


SssVf^nV^S^Vs ffi 5331 7^)5222 k / 2 *<8)5 4 iiV3 
Ss3V 1 *^SssV 2 *^S 222 V 3 ffi SssV(®S 222 Vf®SssVs 
SszVf®Ss 2 iVf®Ss2iVs ffi Ss 2 iVf®SssVf®Ss 2 iVs. 
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graded 

piece 

dim I d 

necessary G modules 
of minimal generators 

dimension of 
necessary mingens 

h 

0 

M 2 = 

0 

0 

h 

600 

m 3 = 

S 3 S 3 S h 1 , 151 , i,i 

600 

h 

48,600 

m 4 = 

0 

0 

h 

1,993,977 

m 5 = 

<S , 3,1,1‘5'3,1,1‘5'3,1,1<S'3,1,1 

®*S f 2,2,l‘S , 2,2,l*S , 2,2,l*S f 2,2,l 

1,377 

h 

54,890,407 

m 6 = 

'S , 4,l,l'S , 3,3'S , 2,2,2»S'2,2,2 ( 8lC 2 

©>5'3,3>S , 3,3>S'2,2,2*S , 2,2,2G)C 2 

T5 *5*3,2,1 *S*3,2,1 *5*2,2,2 *5*2,2,2 

ffi'S2,2,2 l S , 2,2,2'S , 2,2,2'S , 2,2,2®C' 2 

©'S , 6*S , 3,3*S , 3,3>S , 2,2,2 

37,586 

h 

1,140,730,128 

m 7 = 

0 

0 

h 

18,940,147,947 

M 8 = 

S , 4 i 4S , 4 j 4S'4 ) 4S , 4 ) 2,2 ® C 2 

162,000 

h 

> 223,072,284,455 

M 9 > 

S < 5 ! 4>S , 5 i 4>S , 5 ! 4>S , 4 ] 3 i 2 

> 3,087,000 


TABLE 3. The dimension and isotypic description of the ideal of the quadri- 
focal variety up to degree 9. 


In the next section we work to obtain a similar statement for quadrifocal tensors. 

6. Computational results for the quadrifocal ideal 

The goal of this section is to give a description of the lowest degree part of the vanishing 
ideal for the quadrifocal variety in terms of G- modules. First recall that the polynomial ring 
S d (Vf®Vf®Vf®Vf ) has a graded isotopic decomposition (see [30], for instance) with 
respect to G = ©4 K GL(3) x4 : 

S d (V l *®V 2 *®V 3 *®V£) = 

7r 

with Schur modules S n V* := S ni V*®S n2 V 2 *®S na V 3 ®S V4 V 4 * and multiplicity space C m?r . The 
multiplicity space C mn has a basis given by linear combinations of fillings of shape n. We 
write (S' 7 ri 5' 7 r 2 5' 7 r 3 S' 7 r 4 )<2)C m to indicate the G module (occurring with multiplicity m) gotten 
by summing over all non-redundant permutations of the partitions indexing S n V*. 

Computation 6.1. Let I d denote the degree d piece of the ideal of the quadrifocal variety, 
and let G = & 4 x GL(3) x4 . The symmetry assisted computations of I d up to degree d < 9 
using random points of the quadrifocal variety are reported in Table 0 For 5 < d < 9 the 
dimensions of Id and the list necessary modules of minimal generators in Tableware correct 
with high probability. 

Description of computation: The qualifier “with high probability” refers to the fact that we 
computed the ideal on random subsets of points from the quadrifocal variety, so in principle 
we could have chosen a set of points in special position and would have over counted the 
dimension of the ideal. The set of points in such special position, however, being of lower 
dimension, has measure zero in the quadrifocal variety, so we say the results hold with high 
probability. The proof techniques we use for the degree < 4 computations, however are valid 
unconditionally. 
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The content of this computation is two-fold. First we computed (in Maple) the entire 
ideal degree by degree, up to degree 8 and partially in degree 9, making use of the isotypic 
decomposition of the polynomial ring. Second, we checked for representation-theoretic cer¬ 
tificates for necessity of minimal generators using SchurRings in Macaulay2. In the ancillary 
hies associated with the arXiv version of this manuscript we provide a minimal set of data 
necessary to check our work. This includes basic maple scripts, fillings that yield a basis of 
each isotypic component, and the resulting modules in the ideal. Here is a summary of these 
computations. 

Every irreducible G'-module has the property that it is the vector space spanned by the 
G-orbit of a so-called highest weight vector (see [28]). We used Young symmetrizers to obtain 
a basis of each multiplicity space C mn in the isotypic decomposition of the polynomial ring 
above. Then we computed the subspace of C m,r that vanished on the quadrifocal tensors. 
This algorithm was used and outlined in [4] [8] for example. Since we learned this algorithm 
from the paper of Landsberg and Manivel [30], we call this algorithm the Landsberg-Manivel- 
algorithm or the LM-algorithm for short. 

The output of this symmetry-enhanced polynomial interpolation computation is a G- 
modulc description of the ideal in each degree. The dimensions of these modules give the 
beginning of the Hilbert function of the quadrifocal ideal, and are also reported in Table [3] 
We will use notation of SchurRings: The ring Sym*(V* <8)14*) is regarded as a 
tower of rings. The variables in each ring are represented as s^, (respectively t n , u 7r , v n ), for 
partitions i r. The correspondence between the two notations is 

ms^t^u^v^ ^ S^Vi^S^V^S W3 H 3 * ®S ni V£®C m . 

In degree 3 our application of the LM-algorithm found the following modules: 

h = ( s (l,1,1)^(1,1,1)^3 + (s(l,1,1^3 + s 3t(l, 1,1) ) u (l, 1,1) ) u 3 + (( s (l,l, 1)^3 + s 3^(l, 1,1) )«3 + S3t3^(l, 1,1) )^(1,1,1)- 

To save space, we only record those modules up to the © 4 -action: 

Ms := 1 )^( 1 , 1 , 1 ) 

This module corresponds to the same 600 polynomials in Proposition 15.31 Non-vanishing of 
polynomials on random points of a variety implies non-vanishing for the entire variety, so 
this 600 dimensional vector space of cubics are the only cubics vanishing on the quadrifocal 
variety. Therefore, the d < 3 computations hold with no “high probability” qualifier. 

The modules we found (by the LM-algorithm) in degree 4 are (up to the ©4-action): 

I4 = (^G + (s 4 + S( 3 , 1 ))£( 3 , 1 ))W( 2 , 1 ,1)^(2,1,1) 

Because 81 * 600 = 48600, we guess that all the equations in J 4 come from linear combi¬ 
nations of products of linear forms with the 600 cubics in J 3 and thus we would guess that 
there are no new generators in degree 4. We would like to prove this using representation 
theory. 

The multiplication in the ring S ,, (H 1 *<g)H 2 *<g)V^*<g)V2' t ) is just the usual polynomial multi¬ 
plication. It is not easy to determine the isotopic version of multiplication. However, we 
can get a lower bound on the modules of minimal generators in our ideal. The basic idea 
is the following. The multiplication in the ring S* (V* <8)14*£314*) is also the restriction 
of the multiplication in the tensor ring (V*<S)V£<S)V£<8V £)®. The tensor product of two 
representations of the form S n V* is obtained by an iteration of the Littlewood-Richardson 
rule. 
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Using the package SchurRings [38] we computed the tensor product 

h®(y*®v;®v*®vZ) 

and found that it exactly coincides with J 4 . This is an indication that there are prob¬ 
ably no new generators in degree 4, however it could be that the modules resulting in 
4 . 3 ® U* G V -2 ®Vf®>V 4 * are not actually in The only way that there 

could be new minimal generators in degree 4 is if there were already some syzygies amongst 
the degree 3 equations. At least in degree 4 it is possible to look at the highest weight vectors 
and check that they are in the ideal generated by the 600 cubics in Macaulay2. On the other 
hand, we can argue in a less computationally intensive way by using the following special 
case of a more general idea from [45], which was employed in [44J [40]. 


Lemma 6.2. Suppose F n := (F ni , F n2 , F ns , F n4 ) is a filling using the ordered alphabets 
(*4i, *42, * 43 , * 44 ) giving a nonzero realization of the module S n V* in S d {yf®Vf®Vf®Vf). 
If Fp := (F /n , F /t2 , F /t3 , F im ) is a filling giving a nonzero realization of the module S t ,V* in 
S d ~ l (Vf® V 2 *®V :j *® V 4 *) and F^ may be obtained from F n by respectively deleting the last used 
letter in each of the alphabets (*4i, *4 2 , *4 3 , *4 4 ), then S n V* is in the ideal generated by S^V*. 


Now we apply Lemma [(Ol to see that is in the ideal generated by SniSmS 3 S 3 . 

In this case everything occurs with multiplicity one, so our work is much easier. The filling 


2 3 


2 3 


produces a realization of a copy of SASsiSAiSm in the ideal of the quadrifocal variety. 
Notice that by removing the fourth letter we obtain the filling 


1 


1 









2 


2 









3 


3 


1 

2 

3 

® 

1 

2 

3 


which produces a nonzero copy of S'liiS'mS'sS's- 

The same argument may be applied to S 4 S 4 S 211 S 2 11 , with the same result. Therefore, none 
of the modules in degree 4 are minimal generators of the ideal of the quadrifocal variety. 

In degree 5 we note that 1929501 — 600 * 3321 = 1,377. This means that the degree 
3 equations cannot generate all of the ideal in degree 5, and there must be at least 1, 377 
new minimal generators in degree 5. Moreover, if there are no degree 2 syzygies amongst the 
degree 3 equations then the space of minimal generators in degree 5 would be precisely 1, 377- 
dimensional. In principle one could try to use a degree-limited Grobner basis computation 
in Macaulay2 to check if each highest weight vector of each of the modules in J 5 is actually 
in ( 13 ) 5 , but memory limitations become a problem. 

However, we can argue by comparing multiplicities of isotypic components. In degree 5 
the LM-algorithm produced the following modules (up to the © 4 -action): 

h = (55^5 + (^5 + 2S(4 ) l))t(4 i l) + (*5 + S(4,l) + S(3,2) )^(3,2) + (35(4,1) + 7S( 3i1]1 ) )t( 3;1 4) ^(3,1.,!) 

+ ( ((^5 + 2-S(4 ! i))t(4 j i) + S( 4il )t( 3]2 ) + (2*5 + 2*(4,l) + 2*^2))*(3,1,1)) u (3,l,l) 

+ ('S5G + (S5+2S(4,1))£(4,1) + (' S 5+S(4,l)+' S (3,2))£(3,2) + (' S (4,1)+' S (3,1,1))^(3,1,1)+ S (2,2,l) i (2,2,l))' u (2,2,l) )^(2,2,1) - 
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Using SchurRings we found that the only modules left in the difference between I 5 and 

h < 8 > are 

A4s '■= s (3,1,1)^(3,1,1)^(3,!,1)^(3,1,1) + 5(2,2,1)^(2,2,lW(2,2,1)^(2,2,1)- 

From this computation we know that these two summands must be among the minimal 
generators. We do not know if there are any other minimal generators in degree 5. 

Remark 6.3. While this ShurRings computation ignores the structure of the multiplicity 
spaces in the ideals, to first approximation it tells if there is any representation-theoretic 
reason for modules to be among the minimal generators. 

In degree 6 the LM-algorithm produced the following modules (up to the © 4 -action) J 6 = 

(•S6t6 + (s6 + 2S( 5]1 ))t(5 !l )-|-(s6 + 2S(5 il )-l-2.S(4 ! 2))t(4,2) + (3S(5 l l)+3S(4 ] 2) + 10S(4 ,l,l))t(4,l, i) ) ^(4, i, i) ^ (4, i, i) 
+ ((56 + 5(5,1) + S(4,2))£(4,l, 1)^(4,1,1) + 5(4,i ! i)f(4 ! i,i)tt(3 ! 3))r’(3 ! 3) 

+ X( s 6 + 3S(5 ! l))t(5 j l) + (S6 + 3S(5 ,i) + 35( 4 , 2 ) )t(4,2) + (2«6 + 7S( 5 ,i) + 7S( 4 , 2 ) + 125(4,1,1) )^(4,1,1) ^(4,1,1) 

+ (5(5,1) + 5(4,2) + 2S(4,i,i))f( 4 ,i,i)U(3, 3 ) 

+ (56^6 + (256 + 65(5,1) )f( 5,1) + (2s 6 + 65(5,1) + 6S( 4j2 ) )t(4,2) + (356 + 105( 5 ,i) + 105( 4 , 2 ) + 16S( 4 ,i,i) )t(4,l,l) 

+ ( s 6 + 2S( 5 ,i) + 2S(4, 2 ) + 45(4,i,i) + s (3,3) ) ^(3,3) 

+ (3S6 + 125(5,1) + 125(4,2) + 16S(4,i,i) + 3S( 3 , 3 ) + 24S( 3 ,2,l) )t(3,2,1))H(3,2,1) )' y (3,2,l) 

+ ( s (5,1)7(5, 1 ) + (5(5,1) + 5(4,2))t(4,2) + ( s 6 + 3S(5,i) + 3S( 4 , 2 ) + 65 ( 4 , 1 , 1 ) )t (4,1,1) ) u (4, 1 , 1 ) 

+ (5(4,1,1)^(4,1,1) + 56t(3,3))u(3,3) + (( s 6 + 3S( 5 ,i))t( 5 ,i) + (S 6 + 3S( 5 ,i) + 3S( 4 , 2 ) )t(4,2) 

+ (56 + 45(5,1) + 4S( 4 ,2) + 5S(4,i,i))t( 4 ,i,i) + (S(5,i) + 5( 4 , 2 ) + 5(4,1,1) )t(3,3) 

+ (56 + 4S(5,i) + 4s(4, 2 ) + 6S(4,i,i) + 3S( 3 , 3 ) + 8S( 3 , 2 ,l) )t (3,2,1) ) u (3, 2,1) 

+ (sete + 2s( 5 ,i)f ( 5 , 1 ) + (s6 + 5 ( 5 , 1 ) + 2s( 4 , 2 ))t(4,2) 

+ (25(5,1) + 2S(4,2) + 2 s( 4 ,i,i))f( 4 ,i,i) + (S( 5 ,i) + 3S( 4 ,i,i) + 3 5( 3 , 3 ) )t( 3 , 3 ) 

+ ( s (5,l) +5(4,2) +5(4,1,!) + 4s( 3 , 2 ,l) )t(3,2,l) + 3S( 2 ,2,2)^(2,2,2))ll(2,2,2) ) v ( 2 , 2 , 2 ) ■ 
Applying the same SchurRings test we find the following necessary minimal generators: 

Me := 56i(3,3)tl(3,3) + ((2S(4,1,1) + 2S(3,3))t(3, 3 ) + S( 3 , 2 ,l)t(3,2,l) + 2S( 2 ,2,2)£(2,2,2))' U (2,2,2) +(2,2,2) ■ 

These modules could not come from any of the lower degree parts of the ideal, so they are 
among the minimal generators in degree 6 , however there could be more minimal generators 
that we haven’t accounted for if there were syzygies among the lower degree generators. 

The space of degree 7 polynomials 5 , 7 (U 1 *< 8 )V r 2 *( 8 )V^*( 8 )l 4 *) decomposes as a sum of 288 
isotypic components (up to ©4 symmetry) with dimension of multiplicity spaces as large 
as 301. Thus applying the LM-algorithm directly in degree 7 was more challenging and we 
were forced to exploit a parallelism, following a process similar to that in [4T]. We ran the 
LM-algorithm in a separate instance of Maple for each of the 288 isotypic components. This 
allowed us to distribute the computation. Running in parallel the degree 7 computation 
took approximately 12 hours on our two servers with respectively 40 and 24 processors. We 
found 201 modules occurring non-trivially the ideal. The SchurRings test yielded no new 
necessary minimal generators. 

The parallelized LM-algorithm for degree 8 involved 619 Schur modules with multiplicity 
spaces up to 608-dimensional. Parts of this computation took approximately 20 days on our 
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servers. We found 453 modules occurring in J 8 with nonzero multiplicity. The ShurRings 
test only produced one module occurring in / 8 with greater multiplicity than can occur in 
I 7 ®Vi W 2 W 3 W 4 *, namely 

_M 8 := 2S(4 j 4)i(4 j 4)lt(4 ) 4)U(4 ) 2,2)- 

That this module is not in the ideal generated by M. 3 +M. 5 +M.§ can be seen for shape reasons 
alone. The Schur module A4 8 is indexed by a multi-partition containing three partitions of 
shape (4,4), but no module the list of known minimal generators is indexed by quadruple of 
partitions with a subset of 3 of them that each fit in a box of dimensions 2x4. Therefore 
A4 8 must occur with multiplicity 2 in the minimal generators in the ideal. 

In degree 9 there are 1205 isotypic components, with multiplicity spaces as high as 2226- 
dimensional. After approximately one month of computational time on our servers we were 
able to evaluate 1158 of these isotypic components on the quadrifocal variety, 951 of which 
occurred non-trivially in the ideal. The computations that finished were those modules for 
which the multiplicity was low (under around 300). 

Remark 6.4. The main obstruction to completing the degree 9 computation is a lack of 
available machines able to perform Maple computations. More specifically, if a given isotypic 
component has multiplicity m in the polynomial ring, we must populate an m x m matrix 
with m 2 evaluations (from the Young symmetrizer algorithm), once to verify we have fo un d 
a basis of the space, and a second time to evaluate this basis on points of the quadrifocal 
variety. Each of these evaluations could be done in a separate instance. So in the case 
of multiplicity 2226, we must perform approximately 10 million evaluations, which could 
potentially be done on a separate processor if were to take fuller advantage of the parallel 
nature of this problem. One of the largest multiplicity spaces in degree 9 we were able to 
handle was 456 dimensional, and the computation in multiplicity 2226 case is approximately 
23 times larger than this. Since this scale of resources is not currently available to us, it 
seems unlikely that we will be able to complete the degree 9 computation. 

Though we were only able to partially compute Jg, we were able to obtain two new modules 
in degree 9 that must be among the minimal generators of the quadrifocal ideal: 

■M-9 '■= s (5,4)f(5,4)W(5,4)U(4 i 3 ! 2) + S(5,4)^(5,4)^(5,4)^(5,2,2)■ 

These two modules occur with multiplicity 3 in J 9 , but instances of these modules coming 
from / 8 Y Vj* ®Y 2 * 00V 4 * occurred with multiplicity 2. Therefore, these modules must occur 
with multiplicity at least 1 among the minimal generators of I. 

The dimension counts in Table [3]are a straightforward application of a hook length formula, 
which is also implemented in SchurRings for example. 

We wondered if Table [3] might be a complete list of minimal generators for the quadrifocal 
ideal. We performed further experiments with modules in degrees 9, 10, and 11 having low 
multiplicity. We found that S 5t5 S 5t5 S 5t5 S 4 t3t3 occurred with multiplicity one greater in J 10 
than what could come from the part of J 9 that we were able to compute. This was the only 
new module we found in low multiplicity in degree 10. Further experiments suggest that 
there may be many more new modules of minimal generators with low multiplicity in degree 
11. At present we are not confident enough to form a conjecture as to what happens in 
higher degree, nor in which degree the last set of minimal generators must occur. 

Remark 6.5. In a previous draft we had incorrectly computed the number of equations in 
the ideal because of an unfortunate programming error. 
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